Appropriate Item Partition for Improving the Mining Performance
نویسندگان
چکیده
Along with the progress of information techniques and the increase of information need, some databases in the real world grow very quickly and their sizes become very huge. If the FP-Growth procedure is directly executed on these databases to mine association rules, the computer memory may not allow all nodes of a FP-tree generated from a huge database. In this paper, a sophisticated mining approach with a flexible partition of items is proposed to effectively derive association rules under the constraint of memory limitation. The experimental results show that the proposed approach can make the mining process under the memory limitation always feasible.
منابع مشابه
An Efficient Technique for Indexing Temporal Databases
Temporal databases added a new dimension to traditional transaction databases. This dimension is the life time of each item, i.e. exhibition period, starting from the partition when this item appears in the transaction database to the partition when this item no longer exists. Mining temporal association rules became very interesting topic in many applications nowadays. In this paper, an effici...
متن کاملPrediction of effect of fine particle removal on efficiency of a spiral circuit by size-by-size partition curves
Partition curves are widely used to determine the spiral separator efficiency. In this work, the partition curves were used in order to investigate the particle transportation to concentrate and tailing streams. Simulation of fine particle removal using the size-by-size partition curves showed that the recovery of gangue particles to concentrate can decrease 8.7%. It also showed that the recove...
متن کاملUse of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems
One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...
متن کاملFrequent Pattern Mining using Candidate Generation approach with Single Scan of Database
Most of the algorithms for discovering association rules require multiple passes over the database resulting in a large number of disk reads and placing a huge burden on the I/O subsystem [1]. To reduce this bottleneck in case of large databases, a new association rule mining algorithm, which uses both the Partition and the Apriori approach for calculating the frequent item sets in a single pas...
متن کاملExamining the Performance of Vertical Fragmentation using FP-MAX Algorithm
Today’s business Environment has an increasing need for consistent, scalable, reliable and accessible information which grows steadily.The purpose of this work is to analyse the performance of Vertical Fragmentation on large as well as small database such as educational database, data warehouses, medical databases. Vertical Fragmentation has an important impact in improving the performance of m...
متن کامل